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Blind  Equalisation  With  the  Lattice  Constant  Modulus  Algorithm 

R.A.  Axford,  Jr. 

Naval  Command,  Control  and  Ocean  Surveillance  Center 
RDTE  Division,  Code  844,  San  Diego,  CA  92152-5733 


Abstract  This  paper  presents  an  evaluation  of  the  performance 
of  the  lattice  constant  modulus  algorithm  (LCMA)  in  blind 
channel  equalization.  The  convergence  performance  of  LCMA 
is  compared  to  that  of  its  transversal  counterpart  in  equalizing 
the  distortion  of  four  progressively  more  stressing  finite 
impulse  response  channels  for  8-PSK  and  16'QAM  signals. 
The  results  indicate  that  while  the  convergence  behavior  of 
both  algorithms  depends  strongly  on  the  transmitted 
constellation,  LCMA  exhibits  superior  performance  for  16- 
QAM  as  the  spectral  dynamic  range  of  the  chatmel  increases. 

1  Introduction 

In  traditional  adaptive  channel  equalization  schemes,  training 
sequences  of  known  data  are  periodically  sent  by  the 
transmitter  to  allow  adjustment  of  the  receiver's  equaiizer 
coeCGcients.  Blind  equalization  techniques  accomplish  the 
adjustment  of  adaptive  equalizer  coefficients  without  the  use  of 
training  sequences.  The  majority  of  blind  equalization  schemes 
can  be  categorized  as  either  Bussgang  techniques  or 
polyspectra-based  techniques  [1].  Members  of  the  Bussgang 
family  are  typically  similar  to  the  LMS  algorithm  in  that 
stochastic  gradient  descent  techniques  are  used  to  minimize  a 
cost  function  that  depends  on  both  the  output  of  the  equalizer 
and  some  (known  or  assumed)  statistical  description  of  the 
transmitted  signal.  The  most  widely  utilized  of  the  Bussgang 
techniques  for  blind  equalization  is  the  constant  modulus 
algorithm  (CMA)  [2],  [3].  CMA  is  based  on  the  recogrution 
that  multipath  distortion  and  additive  interference  cause 
amplitude  fluctuations  in  a  signal  that,  in  the  absence  of  these 
impairments,  would  otherwise  exhibit  a  constant  amplitude  (at 
the  slicing  instants).  CMA  is  also  applicable  to  signals  with 
multiple  amplitudes,  such  as  A/-ary  QAM,  in  which  the 
"constant  modulus*  becomes  a  quantity  related  to  the  average 
amplitude  of  the  constellation.  In  this  paper  we  will  compare 
transversal  and  lattice  filter  implementations  of  the  "2-2* 
variant  of  CMA  [3],  each  based  on  stochastic  gradient 
minimization  of  the  phase-blind  cost  function 


where  d(R)  is  the  equalizer's  estimate  of  the  transmitted 
symbol,  and  the  expected  values  in  the  definition  of  the 
constant  /Ij  are  taken  over  all  symbols  in  the 

transmitted  constellation. 

From  (1)  we  note  that  J  depends  on  the  transmitted 
constellation  through  the  constant  R^.  However,  the  use  of  an 


alternate  value,  say  only  results  in  a  scaling  of  the 
amplitude  of  a(n)  by  While  the  appropriate  selection  of /?2 

is  important  for  smooth  switching  between  CMA  and  the 
decision-directed  mode  (4],  we  see  that,  except  for  this  scaling, 
the  CMA  cost  function  is  independent  of  the  transmitted 
constellation. 

It  is  well  known  that  CMA  converges  more  slowly  and  exhibits 
greater  residual  mean-square  estimation  error  after  convergence 
for  QAM  than  for  PSK  signals.  These  results  are  not 
surprising  if  we  consider  the  error  function  component  of  the 
CMA  stochastic  gradient  weight  update: 

c(n)  =  a(n)(|fl(«)f  -R^).  (2) 

For  QAM,  e(n)  is  non-zero  for  perfect  symbol  estimates  while 
for  PSK  it  is  zero  for  perfect  symbol  estinutes. 

It  is  possible  to  accelerate  blind  equalizer  convergence  for 
QAM  signals  by  employing  variations  of  CMA  that  take 
advantage  of  knowledge  of  the  transmitted  constellation  [S], 
[6].  Here,  however,  we  will  study  the  performance  of  a  CMA- 
convergence-accelerating  technique  for  which  knowledge  of  the 
transmitted  constellation  is  unnecessary,  namely  the  lattice 
constant  modulus  algorithm  (LCMA)  [7].  The  methods  used  in 
this  paper  are  based  on  the  early  stochastic  gradient  lattice 
(SQL)  equalizer  work  of  Satorius  and  Alexander  in  [8],  and  on 
the  more  recent  comparative  evaluation  of  several  blind 
equalization  algorithms  by  Sbynk  et  al.  in  [9]. 

2  The  Transversal  and  Lattice  Constant  Modulus 
Algorithms 

This  section  defines  the  two  blind  equalization  algorithms 
considered  in  the  paper.  In  all  that  follows,  use  is  made  of  the 
notatiorul  conventions  for  complex-valued  random  variables 
contained  in  [10]. 

2.1  Transversal  CMA  (TCMA) 

The  coefficient  update  formula  of  TCMA  with  a  normalized 
step-size  can  be  expressed  as 

w(n  + 1)  =  w(n)  -  .^-*(n)fl'(n){|5(R)f  -  Rj^in)  (3) 
where  *  indicates  complex  conjugate, 

u(n)  =  [u(n)  u{n-l)  ...  u(n-L)f  (4) 

is  the  vector  of  samples  of  the  received  waveform  u(n), 

w(«)  =  K(n)  w,(n)  ...  w^(n)]'^  (5) 

is  the  vector  of  variable  tap-weights  at  time  n, 


«(ii)  =  w*(»i)u(fi)  =  5^iv;(«)«(n-/)  (6) 

is  the  oalput  of  the  L-f  1  tap  equalizer  at  time  n, 

P»  =  (l-a,)A(«-l)  +  u''(«)u(«)  (7) 

is  a  ninning  estimate  of  the  power  in  the  equalizer,  and 

0  <  a,  <  1  (8) 

is  the  normalized  step-size  parameter.  The  estimate  defined  in 
(7)  is  biased  and  over-estimates  the  power  in  the  equalizer  by 
Of*.  The  appropriate  initialization  of  (7)  is 

P.(0)  =  (L  +  l)P.a;*  (9) 

where  is  the  true  power  in  u(n).  Reducing  <2^  decreases 
both  the  step  size  and  the  asymptotic  MSE  of  the  algorithm. 
Increasing  Uj-  allows  the  algorithm  to  converge  mote  quickly 
at  the  expense  of  increasing  the  asymptotic  MSE. 

2  J  Lattice  CMA  (LCMA) 

The  structure  of  the  LCMA  equalizer  is  shown  in  figure  1.  The 
lattice  order  recursion  update  equations  are  given  by 

/r(«)  =  /r-.(«)-^rWVt(«-l)  (W) 

^(n)  =  (11) 

where  X^(n)  is  the  f'* -stage  PARCOR  coefficient,  and  /,(rt) 
and  b((it)  are  the  f'*-order  forward  and  backward  prediction- 
errors,  respectively,  all  at  time  n.  Note  that/g(R)  =  bf/(n)  =  u(n). 
The  f '^-stage  PARCOR  coefficient  is  updated  according  to 

X,(n-H)  =  JC,(n)+^(n)[b;..(R-l)/,(n)-^ /,.,(R)b;(R)]  (12) 
where 

4r(«)  =  (1  -  ar  - 1) + [|/r-.(«)r  + 1*/-.  («  "  1)1']  (13) 

is  a  biased  estimate  of  the  total  prediction-error  power  entering 
the  f^-stage,  and  0<a^<l  is  a  normalized  step  size 
parameter.  The  update  (12)  accomplishes  a  method-of- 
steepest-descent  minimization  of  the  cost  function 

V=|(^[l/»l1+^[l'’r('»)f]).  (14) 

The  purpose  of  the  lattice  structure  in  the  context  of  charmel 
equalization  is  to  transform  the  correlated  input  sequence 
u(n),u(n-l),  ...,u(n-L).  into  the  uncorrelated  sequence 
bg(n),  b^(it),...,bi^(n)  of  backward  prediction-errors.  (It  can 
be  shown  that  the  infornution  content  of  both  sequences  is 
identical.  Sec  section  4.16  of  [10].)  The  significance  of  this 
when  training  sequences  arc  used  can  be  appreciated  by 
viewing  the  lattice  equalizer  as  a  transversal  filter  with  tap- 
weights  (joint  process  estimation  coefficients) 

g(«)  =  [&(")  ft(«)  •••  «i(«)r.  (15) 

"pre-conditioned”  input  vector 

b(n)  =  [bo(n)  b,(n)  ...  b^(/i)f,  (16) 

and  output 

a(«)  =  g''(«)b(n)  =  5^g;(n)b,(«).  (17) 


When  training  sequences  are  used  and  the  cost  function  is 
simply  the  mean-squared-error  between  the  equalizer  ou^t 
and  the  transmitted  symbol  a(ii) 

•^j«c=l?[|«(«)-«(«)r].  (18) 

the  resulting  error  function  component  (e(n)  -  a(n)-  a(/t))  of 
the  stochastic  gradient  weight  update  is  linear.  Furthermore, 
after  the  PARCX^R  coefficients  have  converged,  the 
autocorrelation  matrix  of  b(n) 

D4£[b(/i)b«(«)]  (19) 

is  diagonal  and  given  by 

D  =  diag[£[|b,(4]  £[|b.(R)f]  ...  £[|b,(R)f]].  (20) 

Thus  the  eigenvalues  of  D  consist  of  the  backward  prediction- 
error  powers,  orders  0  through  L.  Typically,  the  dynamic 
range  of  these  powers  is  less,  often  much  less,  than  the 
dynamic  range  of  the  eigenvalues  of  the  correlation  matrix  of 
the  received  waveform  samples 

R4£[u(n)u"(n)].  (21) 

As  a  result,  the  quadratic  performance  surface  of  the  trained 
stochastic  gradient  lattice  (SGL)  equalizer  in  the  space  of  the 
tap  weights  g(n)  will  be  considerably  less  distorted  than  that  of 
the  corresponding  LMS  transversal  equalizer  in  the  space  of  the 
tap  weights  w(r)  when  the  spectral  dynamic  range  of  the 
received  signal  u(r)  is  large.  Hic  linearity  of  the  "algorithm¬ 
driving*  error  function,  combined  with  the  fact  that  D  is 
diagonal,  allows  a  decoupling  of  the  update  equations  for  the 
elements  of  g(n)  in  the  trained  SGL  equalizer  to  the  extent  that 


Figure  1  Lattice  CMA  Equalizer;  (a)  single  lattice  stage; 
(b)  (Z.-f  l)-tap  lattice  equalizer  using  cascaded  stages. 


(i)  a  scpaiate  normalized  step  size,  and  ((i)  an  'order-locar 
error  function  v^(n)  =  a,(n)-a(n)  may  be  used  in  updating 
each  gf(n)  [8].  (a,(/i)  is  the  /‘‘-order  joint  process  estimate  of 
a(n).) 

In  LCMA,  since  the  error  function  component  of  the  weight 
update  is  nonlinear,  order-local  error  terms  cannot  be  used. 
The  update  equations  for  the  elements  of  g(n)  remain  coupled 
through  their  joint  use  of  e(n)  defined  in  eq.  (2). 
Consequently,  some  of  the  "modularity*  of  the  trained  SGL 
equalizer  is  lost  in  LCMA.  Nonetheless,  separate  normalized 
step  sizes  can  still  be  used  for  each  element  of  g(n)  in  LCMA. 

The  /“*  variable  tap-weight  g,(n)  of  the  LCMA  equalizer  is 
updated  according  to 

&(/i  +  l)  =  g,(n)-.^/(«)a’(n)(|a(/i)f  (22) 

where 

P*,(n)  =  (l-aj4(n-l)-t-|6,(«)f  (23) 

is  a  biased  estimate  of  the  /‘‘-order  backward  prediction-error 
power,  and  0<a^  <1  is  the  normalized  step  size  parameter. 
Varying  has  the  same  general  effects  on  the  convergence 
time  and  asymptotic  MSE  of  LCMA  as  varying  Oj.  does  on  the 
performance  of  TCMA. 

13  Blind  Equalizer  Coefficient  Initialization 
In  [2],  Godard  argued  that  undesirable  local  minima  of  the  cost 
function  J  in  the  space  of  the  variable  tap  weights  can  be 
avoided  by  using  what  has  come  to  be  called  the  "spike 
initialization".  This  consists  of  setting  the  initial  value  of  all 
variable  tap-weights  to  zero  with  the  exception  of  a  single 
weight  at  or  near  the  center  of  the  equalizer.  According  to 
Godard,  the  initial  value  of  this  "reference"  tap-weight  should 
be  set  to  a  value  larger  than  a  constant  that  depends  on 
moments  of  the  transmitted  constellation  and  "the  channel 
impulse  response  sample  having  the  largest  magnitude*  (see  eq. 
(38)  of  (2]).  The  universal  validity  of  this  assertion  has 
subsequently  been  disproven  [11],  [12].  The  search  for  a  blind 
equalizer  that  will  converge  to  a  global  minimum  regardless  of 
the  initial  values  of  the  variable  tap-weights  is  a  topic  that  is 
beyond  the  scope  of  this  paper.  It  is  sufficient  for  our  purposes 
to  note  that  wnile  there  are  situations  of  significant  practical 
interest  for  which  CMA  fails  to  converge  from  the  "spike 
initialization",  there  are  also  situations  of  significant  practical 
interest  for  which  CMA  docs  in  fact  successfully  converge 
using  this  initialization. 

In  all  simulations  that  follow,  we  set  L-t-l=ll  taps,  and  the 
initial  tap-weight  vectors  are  always  set  to 

w(0)  =  [0  0  0  0  0  1  0  0  0  0  0]'^  (24) 

for  TCMA,  and 

g(0)  =  [0  0  0  0  0  1  0  0  0  0  0]',  (25) 


/f,(0)  =  0,  /  =  1,2,...,10  (26) 

for  LCMA.  For  both  algorithms,  this  corresponds  to  an  initial 
all-pass-fillCT  equalizer  transfer  function  with  a  group  delay  di 
5. 

3.  Simulation  Configuration 

A  schematic  diagram  of  the  computer  simulations  is  shown  in 
figure  2.  The  transmitted  signal  a(n)  consists  of  an 
uncorrclated  sequence  of  either  8-PSK  symbols  with  unit 
amplitude,  or  16-QAM  symbols  with  real  and  imaginary 
components  drawn  from  {±1,  ±3}.  The  appropriate  values  of 
/?2  from  eq.  (1)  arc  1  and  13.2,  respectively. 


Error 

Figure  2  Computer  simulation  configuration. 


The  impulse  response  h(n)  of  the  charmcl  is  given  by 


h(n)  =  h,S(n-l)+h^S(n-2)  +  hjS(n-3)  (27) 

where 


h,= 


r,  (2jr 

1+cos  - 

(  W 


2^1 

0,  otherwise 


(*-2)] 


.  *  =  1,2,3 


and 


h  =  [/ii  k, 


(28) 

(29) 


Thus,  the  charmel's  impulse  response  is  normalized  such  that 
+h^+h^  =  1  for  all  values  of  the  bandwidth  parameter  W. 
Accept  for  this  normalization,  the  charmcl  model  is  the  same 
used  by  Satorius  and  Alexander  in  [8].  Complex  additive  white 
Gaussian  noise  v(n)  is  added  to  the  output  of  the  channel  to 
form  the  received  waveform  u(/t).  The  real  and  imaginary 
components  of  v(n)  are  independent,  zero-mean  Gaussian 
random  processes,  each  with  variance  0.001.  The  transmitted 
sequence  is  delayed  by  the  combined  group  delay  of  the  charmel 
and  equalizer  (7  samples),  and  compared  to  the  estimates 
produced  by  the  equalizer.  The  estimation  error  e^(n)  is 
formed  as  shown  in  figure  2.  Given  that  the  charmcl  impulse 
response  samples  arc  real-valued  and  that  perfect  carrier 
recovery  is  assumed,  a  decision-directed  phase  locked  loop  is 
not  used.  Nonetheless,  the  tap-weights  in  w(/i)  and  g(n)  arc 
rotated  at  each  iteration  such  that  the  imaginary  component  of 
the  reference  tap  (/  =  5)  is  kept  equal  to  zero.  This  was  found 
to  be  necessary  in  order  to  prevent  d{n)  from  spitming  slightly 
for  the  largest  values  of  Oj. 


with 


A.  Results 

Figure  3  slums  four  LCMA  learning  curves,  one  corresponding 
to  each  value  of  W  as  shown,  for  the  16'QAM  signal  and 
a^  =0.0001.  Each  curve  was  obtained  by  ensemble-averaging 
|e^  («)|*  over  10  independent  trials.  The  ensemble-averaged 
curves  were  further  smoothed  with  a  SO-point  moving-average 
filter  as  was  done  in  [9].  Figure  4  shows  three  LCMA  learning 
curves  for  W=2.9  and  the  values  of  indicated  on  the  plot. 
Note  that  while  decreasing  increases  the  convergence  time, 
it  also  reduces  the  asymptotic  MSE  e],  . 


F^re  3  LCMA  learning  curves  for  four  values  of  W. 


In  order  to  compare  the  convergence  performance  of  LCMA 
and  TCMA  independently  of  the  values  of  the  step-size 
parameters,  several  smoothed  learning  curves  were  produced 
for  each  algorithm  using  various  values  of  and  .  From 
these  curves,  the  asymptotic  MSE  si  and  the  number  of 
iterations  required  to  reach  el  were  determined  and  plotted 
producing  convergence  performance  curves.  As  noted  in  [9], 
when  compring  two  adaptive  equalization  algorithms  in  this 
way,  if  one  algorithm's  prformance  curves  always  lie  below 
those  of  another,  then  the  former  algorithm  clearly  has  suprior 
convergence  prformance. 


Figure  5  shows  the  LCMA  (solid)  and  TCMA  (dotted) 
convergence  performance  curves  for  4  values  of  W  and  both 
modulations.  These  prformance  curves  sumiiurize  the 
information  gathered  from  a  total  of  134  smoothed  learning 
curves.  The  left-hand  (right-hand)  extreme  of  any  prformance 
curve  correspnds  to  the  largest  (smallest)  step  size  used. 
Horizontal  dashed  lines  indicate  the  MSE  levels  achieved  by  an 
11 -tap  Wiener  equalizer  (cost  function  defined  in  eq.  (18))  for 
8-PSK  (unit  amplitude)  and  16-QAM  ({±1, ±3})  when  the 
thermal  noise  variance  is  0.001.  Note  that  as  W  increases,  the 
minimum  MSE  for  this  16-QAM  constellation  increases  more 
rapidly  than  does  that  of  the  8-PSK  signal.  Thus  2  horizontal 
dashed  lines  appear  for  W2  3.3,  the  lower  of  which 
correspnds  to  8-PSK  in  figures  5  (c)  and  (d). 

For  both  algorithms,  the  transmitted  constellation  is  a  stronger 
factor  in  determining  the  convergence  prformance  than  is  the 
spctral  dynamic  range  of  the  channel  (at  least  for  the 
constellations  and  channel  prameteis  used  here).  Both 
algorithms  always  converge  more  rapidly  for  8-PSK  than  for 
16-QAM  when  chaimel  conditions  arc  equal.  Furthermore, 
both  algorithms  converge  more  quickly  for  8-PSK  under  the 
most  stressing  charmel  conditions  used  in  these  simulations 
(W=3.5)  than  for  16-QAM  under  the  least  stressing  chaiuicl 
conditions  (W=2.9). 

For  8-PSK,  these  simulation  results  indicate  that  the 
convergence  prformance  of  LCMA  is  actually  slightly  inferior 
to  that  of  TCMA  until  the  eigenvalue  ratio  of  R  reaches 
roughly  16  dB.  However,  for  both  modulations,  LCMA  is  less 
sensitive  than  TCMA  to  increases  in  the  spctral  dynamic 
range  of  u(n). 

For  16-QAM,  the  convergence  prformance  of  LCMA  becomes 
suprior  to  that  of  TCMA  as  the  eigenvalue  ratio  of  R  increases 
above  roughly  13  dB.  Thus  the  stochastic  gradient  lattice  can 
indeed  accelerate  the  convergence  of  a  CMA  blind  equalizer  for 
amplitude  modulated  signals. 

5  Summary 

This  ppr  has  presented  the  first  evaluation  of  the  convergence 
prformance  of  the  lattice  constant  modulus  algorithm  in  blind 
charmel  equalization.  The  appropriate  ’spike  initialization*  of 
the  LCMA  equalizer  has  been  demonstrated.  The  clear 
convergence  rate  supriority  of  trained  SGL  equalizers  over 
trained  transversal  LMS  equalizers  is  not  always  observed  in 
the  corresponding  *2-2’  CMA  versions  of  these  Glters.  The 
convergence  prformance  of  both  lattice  and  transversal  CMA 
equalizers  depnds  strongly  on  the  transmitted  constellation. 
However,  as  the  spctral  dynamic  range  of  the  received  signal 
increases,  the  convergence  prformance  of  LCMA  is  degraded 
to  a  lesser  degree  than  that  of  TCMA.  Further  work  is  requited 
to  fully  understand  effects  of  the  spctral  dynamic  range  of  the 
received  signal  on  the  CMA  prformance  surface. 


Figure  5  Conveigencc  performance  curves  for  LCMA  (solid)  and  TCMA  (dotted).  Horizontal  dashed  lines  indicate  Wiener  MSEs. 
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